Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

نویسندگان

  • Kamran Kowsari
  • Maryam Yammahi
  • Nima Bari
  • Roman Vichr
  • Faisal Alsaby
  • Simon Y. Berkovich
چکیده

searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Dictionary, and reflecting the presence or absence of particular letters. This representation preserves closeness of word distortions in terms of closeness of the created binary vectors within Hamming distance of 2 deviations. This paper talks about the Golay Coding Transformation Hash Table and how it can be used on a FuzzyFind Dictionary as a new technology for using in searching through big data. This method is introduced by linear time complexity for generating the dictionary and constant time complexity to access the data and update by new data sets, also updating for new data sets is linear time depends on new data points. This technique is based on searching only for letters of English that each segment has 23 bits, and also we have more than 23-bit and also it could work with more segments as reference table. Keywords—FuzzyFind Dictionary; Golay Code; Golay Code Transformation Hash Table; Unsupervised learning; Fuzzy search engine; Big Data; Approximate search; Informational Retrieval; Pigeonhole Principle; Learning Algorithms ; Data Structure

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Weighted Unsupervised Learning for 3D Object Detection

searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind Dictionary for text mining was introduced. We simply mapped the 23 bits of the English alphabet into a FuzzyFind Dictionary or more than 23 bits by using more FuzzyFind Diction...

متن کامل

A Novel Image Denoising Method Based on Incoherent Dictionary Learning and Domain Adaptation Technique

In this paper, a new method for image denoising based on incoherent dictionary learning and domain transfer technique is proposed. The idea of using sparse representation concept is one of the most interesting areas for researchers. The goal of sparse coding is to approximately model the input data as a weighted linear combination of a small number of basis vectors. Two characteristics should b...

متن کامل

A New Dictionary Construction Method in Sparse Representation Techniques for Target Detection in Hyperspectral Imagery

Hyperspectral data in Remote Sensing which have been gathered with efficient spectral resolution (about 10 nanometer) contain a plethora of spectral bands (roughly 200 bands). Since precious information about the spectral features of target materials can be extracted from these data, they have been used exclusively in hyperspectral target detection. One of the problem associated with the detect...

متن کامل

Construction of non-square M-QAM sequences with low PMEPR for OFDM systems - Communications, IEE Proceedings-

The author considers the use of coding to reduce the peak-to-mean envelope power ratio (PMEPR) for orthogonal frequency division multiplexing (OFDM) systems. Most of the existing schemes that use coding for PMEPR reduction assume a PSK constellation. The author presents the construction of non-square M-QAM symbols from a combination of QPSK and BPSK signals when M=2 and n is an odd number. By u...

متن کامل

Construction of 16-QAM OFDM Codes with Reduced Peak to Average Power Ratio using Golay Complementary Sequences

OFDM is a powerful multicarrier transmission technique used extensively for wireless applications. High PAPR is one of the deleterious problems of OFDM. This paper reviews basic OFDM system and PAPR problem associated with it. Also, the construction of M-QAM particularly 16-QAM sequences using QPSK Golay sequences over 4  is conferred. It is elucidated that a 16-QAM constellation can be writte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1503.06483  شماره 

صفحات  -

تاریخ انتشار 2015